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HT1220 Report 



RECORD INFORMATION 



Gene ID: 
Sequence ID: 
Protein ID: 
Sequence name; 
Genome : 
Taxon: 
Locus : 

Common Name : 
Role ID: 



1220 
1220 
1220 

thrombospondin 
nucleus 
Homo sapiens 
1220 

thrombospondin 
40 



1, alt. transcript 1 



Coding sequence length: 3 513 nt 

Transcript sequence length: 5722 nt 
Expression data: THC2 01673 



ACCESSION DATA 

HT1220 is derived from access ions(s): 

SP:P07996 (THROMBOSPONDIN 1 PRECURSOR.) 

GB :X04 66 5 (Human mRNA for thrombospondin) 

GB :X147 3 7 (Human mRNA for thrombospondin) 

GB :U12471 ( thrombospondin-pS 0 {Homo sapiens}) 

GB:M99425 (Human thrombospondin mRNA, 3 1 end,) 

PIR:G01478 ( thrombospondin -p50 - human (fragment) ) 

GB :U12471 (Human thrombospondin- 1 gene, partial cds . ) 

GB:J04835 (Human thrombospondin gene, exons'l, 2 and 3-) 

GB : M2 563 1 (Homo sapiens (clone lambda-TS-33 ) thrombospondin (THBS) mRNA, 5' 



end - ) 



ALTERNATIVE SPLICE INFORMATION 
Alternative splice forms for this gene: 

HT3 98 7 thrombospondin 1, alt. transcript 2 



MAPPLNG DATA 

GDB accession(s) for this gene: 

GDB ID: Symbol 
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qdb : 120433 THBS1 



cDNA FEATURES 



Feature 



End 5 End 3 



coding_seq 



112 

3625 

1235 



3624 
5722 
1236 



3 'UT 

spjunc_h 



SEQUENCE 



nucleotide: 

ggacgcacaggcattcccGgcgcccctccagccctcgccgccctcgccaccgctcccggc 
cgccgcgctccggtacacacaggatccctgctgggcaccaacagctccaccatggggctg 
gcctggggactaggcgtcctgttcctgatgcatgtgtgtggcaccaaccgcattccagag 
tctggcggagacaacagcgtgtttgacatctttgaactcaccggggccgcccgcaagggg 
tctgggcgccgactggtgaagggccccgacccttccagcccagctttccgcatcgaggat 
gccaacctgatcccccctgtgcctgatgacaagttccaagacctggtggatgctgtgcgg 
gcagaaaagggtttcctccttctggcatccctgaggcagatgaagaagacccggggcacg 
ctgctggccctggagcggaaagaccactctggccaggtcttcagcgtggtgtccaatggc 
aaggcgggcaccctggacctcagcctgaccgtccaaggaaagcagcacgtggtgtctgtg 
gaagaagctctcctggcaaccggccagtggaagagcatcaccctgtttgtgcaggaagac 
agggcccagctgtacatcgactgtgaaaagatggagaatgctgagttggacgtccccatc 
caaagcgtcttcaccagagacctggccagcatcgccagactccgcatcgcaaaggggggc 
gtcaatgacaatttccagggggtgctgcagaatgtgaggtttgtctttggaaccacacca 
gaagacatcctcaggaacaaaggctgctccagctctaccagtgtcctcctcacccttgac 
aacaacgtggtgaatggttccagccctgccatccgcactaactacattggccacaagaca 
aaggacttgcaagccatctgcggcatctcctgtgatgagctgtccagcatggtcctggaa 
ctcaggggcctgcgcaccat.tgtgaccacgctgcaggacagcatccgcaaagtgactgaa 
gagaacaaagagttggccaatgagctgaggcggcctcccctatgctatcacaacggagtt 
cagtacagaaataacgaggaatggactgttgatagctgcactgagtgtcactgtcagaac 
tcagttaccatctgcaaaaaggtgtcctgccccatcatgccctgctccaatgccacagtt 
cctgatggagaatgctgtcctcgctgttggcccagcgactctgcggacgatggctggtct 
ccatggtccgagtggacctcctgttctacgagctgtggcaatggaattcagcagcgcggc 
cgctcctgcgatagcctcaacaaccgatgtgagggctcctcggtccagacacggacctgc 
cacattcaggagtgtgacaaaagatttaaacaggatggtggctggagccactggtccccg 
tggtcatcttgttctgtgacatgtggtgatggtgtgatcacaaggatccggctctgcaac 
tctcccagcccccagatgaatgggaaaccctgtgaaggcgaagcgcgggagaccaaagcc 
tgcaagaaagacgcctgccccatcaatggaggctggggtccttggtcaccatgggacatc 
tgttctgtcacctgtggaggaggggtacagaaacgtagtcgtctctgcaacaaccccgca 
ccccagtttggaggcaaggactgcgttggtgatgtaacagaaaaccagatctgcaacaag 
caggactgtccaattgatggatgcctgtccaatccctgctttgccggcgtgaagtgtact 
agctaccctgatggcagctggaaatgtggtgcttgtccccctggttacagtggaaatggc 
atccagtgcacagatgttgatgagtgcaaagaagtgcctgatgcctgcttcaaccacaat 
ggagagcaccggtgtgagaacacggaccccggctacaactgcctgccctgccccccacgc 
ttcaccggctcaGagcccttcggccagggtgtcgaacatgccacggccaacaaacaggtg 
tgcaagccccgtaacccctgcacggatgggacccacgactgcaacaagaacgccaagtgc 
aactacctgggccactatagcgaccccatgtaccgctgcgagtgcaagcctggctacgct 
ggcaatggcatcatctgcggggaggacacagacctggatggctggcccaatgagaacctg 
gtgtgcgtggccaatgcgacttaccactgcaaaaaggataattgccccaaccttcccaac 
tcagggcaggaagactatgacaaggatggaattggtgatgcctgtgatgatgacgatgac 
aatgataaaattccagatgacagggacaactg'tccattccattacaacccagctcagtat 
gactatgacagagatgatgtgggagaccgctgtgacaactgtccctacaaccacaaccca 
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gatcaggcagacacagacaacaatggggaaggagacgcctgtgctgcagacattgatgga 
gacggtatcctcaatgaacgggacaactgccagtacgtctacaatgtggaccagagagac 
actgatatggatggggttggagatcagtgtgacaattgccccttggaacacaatccggat 
cagctggactctgactcagaccgcattggagatacctgtgacaacaatcaggatattgat 
gaagatggccaccagaacaatctggacaactgtccctatgtgcccaatgccaaccaggct 
gaccatgacaaagatggcaagggagatgcctgtgaccacgatgatgacaacgatggcatt 
cctgatgacaaggacaactgcagactcgtgcccaatcccgaccagaaggactctgacggc 
gatggtcgaggtgatgcctgcaaagatgattttgaccatgacagtgtgccagacatcgat 
gacatctgtcctgagaatgttgacatcagtgagaccgatttccgccgattccagatgatt 
cctctggaccccaaagggacatcccaaaatgaccctaactgggttgtacgccatcagggt 
aaagaactcgtccagactgtcaactgtgatcctggactcgctgtaggttatgatgagttt 
aatgctgtggacttcagtggcaccttcttcatcaacaccgaaagggacgatgactatgct 
ggatttgtctttggctaccagtccagcagccgcttttatgttgtgatgtggaagcaagtc 
acccagtcctactgggacaccaaccccacgagggctcagggatactcgggcctttctgtg 
aaagttgtaaactccaccacagggcctggcgagcacctgcggaacgccctgtggcacaca 
ggaaacacccctggccaggtgcgcaccctgtggcatgaccctcgtcacataggctggaaa 
gatttcaccgcctacagatggcgtctcagccacaggccaaagacgggtttcattagagtg 
gtgatgtatgaagggaagaaaatcatggctgactcaggacccatctatgataaaacctat 
gctggtggtagactagggttgtttgtcttctctcaagaaatggtgttcttctctgacctg 
aaatacgaatgtagagatccctaatcatcaaattgttgattgaaagactgatcataaacc 
aatgctggtattgcaccttctggaactatgggcttgagaaaacccccaggatcacttctc 
cttggcttccttcttttctgtgcttgcatcagtgtggactcctagaacgtgcgacctgcc 
tcaagaaaatgcagttttcaaaaacagactcatcagcattcagcctccaatgaataagac 
atcttccaagcatataaacaattgctttggtttccttttgaaaaagcatctacttgcttc 
agttgggaaggtgcccattccactctgcctttgtcacagagcagggtgctattgtgaggc 
catctctgagcagtggactcaaaagcattttcaggcatgtcagagaagggaggactcact 
agaattagcaaacaaaaccaccctgacatcctccttcaggaacacggggagcagaggcca 
aagcactaaggggagggcgcatacccgagacgattgtatgaagaaaatatggaggaactg 
ttacatgttcggtactaagtcattttcaggggattgaaagactattgctggatttcatga 
tgctgactggcgttagctgattaacccatgtaaataggcacttaaatagaagcaggaaag 
ggagacaaagactggcttctggacttcctccctgatccccacccttaetcatcaccttgc 
agtggccagaattagggaatcagaatcaaaccagtgtaaggcagtgctggctgccattgc 
ctggtcacattgaaattggtggcttcattctagatgtagcttgtgcagatgtagcaggaa 
aataggaaaacctaccatctcagtgagcaccagctgcctcccaaaggaggggcagccgtg 
cttatatttttatggttacaatggcacaaaattattatcaacctaactaaaacattcctt 
ttctcttttttccgtaattactaggtagttttctaattctctcttttggaagtatgattt 
ttttaaagtctttacgatgtaaaatatttattttttacttattctggaagatctggctga 
aggattattcatggaacaggaagaagcgtaaagactatccatgtcatctttgttgagagt 
cttcgtgactgtaagattgtaaatacagattatttattaactctgttctgcctggaaatt 
taggct teat acggaaagtgtttgagagcaagtagttgacatt tat cagcaaatctcttg 
caagaacagcacaaggaaaatcagtctaataagctgctctgccccttgtgctcagagtgg 
atgttatgggattccttttttctctgttttatcttttcaagtggaattagttggttatcc 
atttgcaaatgttttaaattgcaaagaaagccatgaggtcttcaatactgttttacccca 
tcccttgtgcatatttccagggagaaggaaagcatatacacttttttctttcatttttcc 
aaaagagaaaaaaatgacaaaaggtgaaacttacatacaaatattacctcatttgttgtg 
tgactgagtaaagaatttttggatcaagcggaaagagtttaagtgtctaacaaacttaaa 
gctactgtagtacctaaaaagtcagtgttgtacatagcataaaaactctgcagagaagta 
ttcccaataaggaaatagcattgaaatgttaaatacaatttctgaaagttatgttttttt 
tctatcatctggtataccattgctttatttttataaattattttctcattgccattggaa 
tagaatattcagattgtgtagatatgctatttaaataatttatcaggaaatactgcctgt 
agagttagtatttctatttttatataatgtttgcacactgaattgaagaattgttggttt 
tttcttttttttgttttttttttttttttttttttttttgcttttgacctcccattttta 
ctatttgccaatacctttttctaggaatgtgcttttttttgtacacatttttatccattt 
tacattctaaagcagtgtaagttgtatattactgtttcttatgtacaaggaacaacaata 
aatcatatggaaatttatattt 



MGLAWGLGVLFLMHVCGTNRI PESGGDNSVFD I FELTGAARKGSGRRLVKGPD PSS PAFR 



protein: 
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I ED ANL I P P VPDD KFQDLVDAVRAEKGFIiLLAS LRQMKKTRGTLLALERKDHS GQVF S W 
SNGKAGTLDLSLTVQGKQHWSVEEALLATGQWKSITLFVQEDRAQLYIDCEKMENAELD 
VP IQS VFTRDLAS IARLRI AKGGWDNFQGVLQNVRFVFGTTPED ILRNKGCS S STS VLL 
TLDNNVVNGSSPAIRTNYIGHKTKDLQAICGISCDELSSMVLELRGLRTIVTTLQDSIRK 
VTEENKELANELRRPPLCYHNGVQYRN^ 

ATVPDGECCPRCWPSDSADDGWSPWSEWTSCSTSCGNGIQQRGRSCDSLWNRCEGSSVQT 
RTCHIQECDKRFKQDGGWSHWSPWSSCSVTCGDGVITRIRLCNSPSPQMNGKPCEGEARE 
TKACKKDACP INGGWGPWS PWD ICS VTCGGGVQKRSRLCNNPAPQFGGKDCVGDVTENQ I 
CNKQDCPIDGCLSNPCFAGVKCTSYPDGSWKCGACPPGYSGNGIQCTDVDECKEVPDACF 
NHNGEHRCENTDPGYNCLPCPPRFTGSQPFGQGVEHATANKQVCKPRNPCTDGTHDCNKN 
AKCNYLGHYSD PWYRCECKPGYAGNGI I CGEDTDLDGWPNENLVCVANATYHCKKDNCPN 
LPNSGQEDYDKDGIGDACDDDDDNDKIPDDRDNCPFHYNPAQYDYDRDDVGDRCDNCPYN 
HNPDQADTDNNGEGDACAADIDGDGILNERDNCQYVY^ 

NPDQLDSDSDRIGDTCDNNQDIDEDGHQNNLDNCPYVPNANQADHDKDGKGDACDHDDDN 
DGIPDDKDNCRLVPNPDQKDSDGDGRGDACKDDFDHDSVPDIDDICPENVDISETDFRRF 
QMIPLDPKGTSQNDPNWWRHQGKELVQTV^ 

DYAGFVFGYQSSSRFYVVMWKQVTQSYWDTNPTRAQGYSGLSVKVVNSTTGPGEHLRNAL 
WHTGNTPGQVRTLWHDPRHIGWKDFTAYRWRLSHRPKTGFIRVVMYEGKKIMADSGPIYD 
KTYAGGRLGLFVFSQEMVFFSDIiKYECRDP 
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HT2143 Report 



RECORD INFORMATION 



Gene ID : 
Sequence ID: 
Protein ID: 
Sequence name: 
Genome : 
Taxon : 
Locus : 

Common Name : 
Role ID: 



2081 
2143 
2125 

thrombospondin 4 
nucleus 
Homo sapiens 
2081 

thrombospondin 4 
40 



Coding sequence length: 2 8 86 nt 

Transcript sequence length : 3 0 74 nt 
Expression data: THC168897 



ACCESSION DATA 

HT2143 is derived from accessions(s): 

SP:P35443 (THROMBOSPONDIN 4 PRECURSOR . ) 
GB:Z19585 (thrombospondin- 4 {Homo sapiens}) 
GB:Z195 85 (H. sapiens mRNA for thrombospondin-4 ) 
PIR:A55710 (thrombospondin 4 precursor - human) 



cDNA FEATURES 



Feature 



End 5 End 3 



coding_seq 
3 'UT 



28 

2914 



2913 
3074 



SEQUENCE 
nucleotide: 



gaattccggggagcaggaagagccaacatgctggccccgcgcggagccgccgtcctcctg 
ctgcacctggtcctgcagcggtggctagcggcaggcgcccaggccaccccccaggtcttt 
gaccttctcccatcttccagtcagaggctaaacccaggcgctctgctgccagtcctgaca 
gaccccgccctgaatgatctctatgtgatttccaccttcaagctgcagactaaaagttca 
gccaccatcttcggtctttactcttcaactgacaacagtaaatattttgaatttactgtg 
atgggacgcttaagcaaagccatcctccgttacctgaagaacgatgggaaggtgcatttg 
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gtggttttcaacaacctgcagctggcagacggaaggcggcacaggatcctcctgaggctg 
agcaatttgcagcgaggggccggctccctagagctctacctggactgcatccaggtggat 
tccgttcacaatctccccagggcctttgctggcccctcccagaaacctgagaccattgaa 
ttgaggactttccagaggaagccacaggacttcttggaagagctgaagctggtggtgaga 
ggctcactgttccaggtggccagcctgcaagactgcttcctgcagcagagtgagccactg 
gctgccacaggcacaggggactttaaccggcagttcttgggtcaaatgacacaattaaac 
caactcctgggagaggtgaaggaccttctgagacagcaggttaaggaaacatcatttttg 
cgaaacaccatagctgaatgccaggcttgcggtcctctcaagtttcagtctccgacccca 
agcacggtggtcgccccggctccccctgcaccgccaacacgcccacctcgtcggtgtgac 
tccaacccatgtttccgaggtgtccaatgtaccgacagtagagatggcttccagtgtggg 
ccctgccccgagggctacacaggaaacgggatcacctgtattgatgttgatgagtgcaaa 
taccatccctgctacccgggcgtgcactgcataaatttgtctcctggcttcagatgtgac 
gcctgcccagtgggcttcacagggcccatggtgcagggtgttgggatcagttttgccaag 
tcaaacaagcaggtctgcactgacattgatgagtgtcgaaatggagcgtgcgttcccaac 
tcgatctgcgttaatactttgggatcttaccgctgtgggccttgtaagccggggtatact 
ggtgatcagataaggggatgcaaagtggaaagaaactgcagaaacccagagctgaaccct 
tgcagtgtgaatgcccagtgcattgaagagaggcagggggatgtgacatgtgtgtgtgga 
gtcggttgggctggagatggctatatctgtggaaaggatgtggacatcgacagttacccc 
gacgaagaactgccatgctctgccaggaactgtaaaaaggacaactgcaaatatgtgcca 
aattctggccaagaagatgcagacagagatggcattggcgacgcttgtgacgaggatgct 
gacggagatgggatcctgaatgagcaggataactgtgtcctgattcataatgtggaccaa 
aggaacagcgataaagatatctttggggatgcctgtgataactgcctgagtgtcttaaat 
aacgaccagaaagacaccgatggggatggaagaggagatgcctgtgatgatgacatggat 
ggagatggaataaaaaacattctggacaactgcccaaaatttcccaatcgtgaccaacgg 
gacaaggatggtgatggtgtgggggatgcctgtgacagttgtcctgatgtcagcaaccct 
aaccagtctgatgtggataatgatctggttggggactcctgtgacaccaatcaggacagt 
gatggagatgggcaccaggacagcacagacaactgccccaccgtcattaacagtgcccag 
ctggacaccgataaggatggaattggtgacgagtgtgatgatgatgatgacaatgatggt 
atcccagacctggtgccccctggaccagacaactgccggctggtccccaacccagcccag 
gaggatagcaacagcgacggagtgggagacatctgtgagtctgactttgaccaggaccag 
gtcatcgatcggatcgacgtctgcccagagaacgcagaggtcaccctgaccgacttcagg 
gcttaccagaccgtgggcctggatcctgaaggggatgcccagatcgatcccaactgggtg 
gtcctgaaccagggcatggagattgtacagaccatgaacagtgatcctggcctggcagtg 
gggtacacagcttttaatggagttgacttcgaagggaccttccatgtgaatacccagaca 
gatgatgactatgcaggctttatctttggctaccaagatagctccagcttctacgtggtc 
atgtggaagcagacggagcagacatattggcaagccaccccattccgagcagttgcagaa 
cctggcattcagctcaaggctgtgaagtctaagacaggtccaggggagcatctccggaac 
tccctgtggcacacgggggacaccagtgaccaggtcaggctgctgtggaaggactccagg 
aatgtgggctggaaggacaaggtgtcctaccgctggttcctacagcacaggccccaggtg 
ggctacatcagggtacgattttatgaaggctctgagttggtggctgactctggcgtcacc 
atagacaccacaatgcgtggaggccgacttggcgttttctgcttctctcaagaaaacatc 
atctggtccaacctcaagtatcgctgcaatgacaccatccctgaggacttccaagagttt 
caaacccagaatttcgaccgcttcgataattaaaccaaggaagcaatctgtaactgcttt 
tcggaacactaaaaccatatatattttaacttcaattttctttagcttttaccaacccaa 
atatatcaaaacgttttatgtgaatgtggcaataaaggagaagagatcatttttaaaaaa 
aaaaaaaaaaaaaa 



MLAPRGAAVLLLHLVLQRWLAAGAQATPQVFDLLPSSSQRLNPGALLPVLTDPALNDLYV 
I S TFKLQTKS S AT I FGLYS STDNS KYFEFTVMGRLS KAI LRYLKNDGKVHL WFNNLQLA 
DGRRHRILLRLSNLQRGAGSLELYLDCIQVDSVHNLPRAFAGPSQKPETIELRTFQRKPQ 
DFLEELKLVWGSLFQVASLQDCFLQQSEPLAATGTGDFNRQFLGQMTQLNQLLGEVKDL 
LRQQVKETSFLRNTIAECQACGPLKFQSPTPSTWAPAPPAPPTRPPRRCDSNPCFRGVQ 
CTDSRDGFQCGPCPEGYTGNGITCIDVDECKYHPCYPGVHCINLSPGFRCDACPVGFTGP 
MVQGVG I S F AKSNKQVCTD I DECRNGACVPNS I CVNTLGS YRCGPCKPGYTGDQ I RGC KV 
ERNCRKPELNPCSWAQCIEERQGDVTCVCGVGWAGDGYICGKDVDIDSYPDEELPCSAR 
NCKKDNCKYVPNSGQEDADRDGIGDACDEDADGDGIL^ 

DACDNCLSVLNMDQKDTDGDGRGDACDDDMDGDGIKNILDNCPKFPNRDQRDKDGDGVGD 



protein: 
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ACDSCPDVSNPNQSDVDNDLVGDSCDTNQDSDGDGHQDSTDNCPTVINSAQLDTDKDGIG 
DECDDDDDNDGIPDLVPPGPDNCRLVPNPAQEDSNSDGVGDICESDFDQDQVIDRIDVCP 
ENAEVTLTD FRAYQTVGLD P EGDAQ ID PNWWLNQGME I VQTMNSD PGLAVG YTAFNGVD 
FEGTFHVNTQTDDD YAGF I FGYQD S S S F YWMWKQTEQT YWQ ATP FRAVAEPG I QLKAVK 
SKTGPGEHLI^SLWHTGDTSDQVRLLWKDSRNVGWKDKVSYRWFLQHRPQVGYIRVRFYE 
GSELVADSGVTIDTTMRGGRLGVFCFSQENIIWSNXiKYRCNDTIPEDFQEFQTQNFDRFD 
N 
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